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SEQUENCE LISTING 

(1) GENERAL INFORMATION : 



(i) APPLICANT: 



Ciossek, Thomas 
Ullrich, Axel 
Millauer, Birgit 



(ii) TITLE OF INVENTION: 



(iii) NUMBER OF SEQUENCES: 



METHODS FOR DIAGNOSIS 
AND TREATMENT OF MDK1 
SIGNAL TRANSDUCTION 
DISORDERS 



12 



(iv) CORRESPONDENCE ADDRESS: 



(A) 
(B) 

(C) 
(D) 
(E) 
(F) 



ADDRESSEE: 
STREET : 

CITY: 
STATE : 
COUNTRY : 
ZIP: 



Lyon & Lyon 

633 West Fifth Street 

Suite 4700 

Los Angeles 

California 

U.S.A. 

90071-2066 



(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 

(B) COMPUTER: 

(C) OPERATING SYSTEM- 

(D) SOFTWARE: 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER- 

(B) FILING DATE: 

(C) CLASSIFICATION: 



3.5" Diskette, 1.44 Mb 
storage 

IBM Compatible 
IBM P.C. DOS 5.0 
Word Perfect 5.1 



08/368,776 
January 3, 1995 



(vii) PRIOR APPLICATION DATA: 

Prior applications total 
including application 
described below: 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) APPLICANT: 




(ii) TITLE OF INVENTION: 



(iii) NUMBER OF SEQUENCES: 



(iv) CORRESPONDENCE ADDRESS: 



(A) 
(B) 

(C) 
(D) 
(E) 
(F) 



ADDRESSEE : 
STREET : 

CITY: 
STATE : 
COUNTRY : 
ZIP: 



Ciossek, Thomas 
Ullrich, Axel 
Millauer, Birgit 



METHODS FOR DIAGNOSIS 
AND TREATMENT OF MDK1 
SIGNAL TRANSDUCTION 
DISORDERS 



12 



Lyon & Lyon 

633 West Fifth Street 

Suite 4700 

Los Angeles 

California 

U.S.A. 

90071-2066 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 

(B) COMPUTER: 

(C) OPERATING SYSTEM: 

(D) SOFTWARE: 



3.5" Diskette, 1.44 Mb 
storage 

IBM Compatible 
IBM P.C. DOS 5.0 
Word Perfect 5.1 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 



08/368,776 
January 3, 1995 



(vii) PRIOR APPLICATION DATA: 

Prior applications total, 
including application 
described below: 



none 
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(A) APPLICATION NUMBER: 

(B) FILING DATE: 



(viii) ATTORNEY/AGENT INFORMATION: 



(A) NAME: Warburg, Richard J. 

(B) REGISTRATION NUMBER: 32,327 

(C) REFERENCE/DOCKET NUMBER: 208/007 



(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (213) 489-1600 

(B) TELEFAX: (213) 955-0440 

(C) TELEX: 67-3510 



(2) INFORMATION FOR SEQUENCE ID NO: 1: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4304 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

AAGCGGCCGG TCTGCAGTCG GAGACTTGCA GGCAGCAAAC ACGGTGCGAA 50 

CGAACCGGAG GGGGGAGAGA GAAATCAAAC AGCTAAGCGT GGAGCAGACG 100 

GCCTGGGACC CAGAAGGGGA TCGATGCGAG GAGCGCAATA ATAACAACAA 150 

TAATAACCCA CTTCGGAGCA AACAGCATCT AAAGAGCTGC GACCCAACTG 200 

CAGCCTAAAA AAATCAAACC TGCTCATGCA CCATGGTTGT TCAAACTCGG 250 

TTCCCTTCGT GGATTATTTT GTGTTACATC TGGCTGCTTG GCTTTGCACA 300 

CACGGGGGAG GCGCAGGCTG CGAAGGAAGT ACTATTACTG GACTCGAAAG 350 

CACAACAAAC AGAATTGGAA TGGATTTCCT CTCCACCCAG TGGGTGGGAA 400 

GAAATTAGTG GTTTGGATGA GAACTACACT CCGATAAGAA CATACCAGGT 450 

GTGCCAGGTC ATGGAGCCCA ACCAGAACAA CTGGCTGCGG ACTAACTGGA 500 

TTTCTAAAGG CAACGCACAA AGGATTTTTG TAGAATTGAA ATTCACCTTG 550 

AGGGATTGTA ATAGTCTTCC CGGAGTCCTG GGAAdTGCA AGGAAACGTT 600 
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TAATTTGTAC TATTATGAAA CAGACTACGA CACCGGCAGG AATATACGAG 650 

AAAACCTTTA TGTTAAAATA GACACCATTG CTGCAGATGA AAGTTTCACA 700 

CAAGGTGACC TTGGTGAAAG AAAGATGAAG CTGAACACTG AGGTGAGAGA 750 

GATTGGACCT TTGTCCAAAA AGGGATTCTA TCTTGCCTTT CAGGATGTAG 800 

GGGCTTGCAT AGCATTGGTT TCTGTCAAAG TGTACTACAA GAAGTGCTGG 850 

ACCATTGTTG AGAACTTAGC TGTCTTTCCA GATACAGTGA CTGGTTCGGA 900 

ATTTTCCTCC TTAGTCGAGG TCCGTGGGAC ATGTGTCAGC AGTGCCGAGG 950 

AAGAGGCAGA AAATTCCCCC AGAATGCATT GCAGTGCAGA AGGAGAGTGG 1000 

CTAGTACCCA TTGGAAAATG CATCTGCAAA GCAGGCTATC AGCAAAAAGG 1050 

GGACACTTGC GAACCCTGTG GCCGCAGGTT CTACAAATCT TCCTCTCAGG 1100 

ATCTCCAGTG TTCTCGTTGT CCAACCCACA GCTTCTCTGA CCGAGAAGGA 1150 

TCATCCAGGT GTGAATGTGA AGATGGGTAC TACAGAGCTC CTTCTGATCC 1200 

ACCATACGTT GCATGCACGA GGCCTCCCTC TGCACCACAG AACCTTATTT 1250 

TCAATATCAA TCAAACGACT GTAAGTTTGG AATGGAGTCC TCCGGCTGAC 1300 

AACGGGGGAA GAAACGATGT CACCTACAGA ATACTGTGTA AGCGGTGCAG 1350 

TTGGGAACAG GGAGAATGTG TGCCATGCGG AAGTAACATT GGATACATGC 1400 

CCCAGCAGAC GGGATTAGAG GATAACTATG TCACTGTCAT GGACCTACTT 1450 

GCCCATGCAA ATTACACTTT CGAAGTTGAA GCTGTAAATG GAGTTTCGGA 1500 

CTTAAGCAGA TCCCAGAGGC TCTTCGCTGC TGTTAGCATC ACCACCGGTC 1550 

AAGCAGCTCC CTCGCAAGTG AGTGGAGTCA TGAAGGAGCG AGTACTGCAG 1600 

. CGGAGTGTGC AGCTTTCCTG GCAGGAGCCG GAGCATCCCA ATGGAGTCAT 1650 . 

CACGGAATAT GAAATCAAGT ATTATGAGAA AGATCAACGG GAAAGGACGT 1700 

ACTCAACACT CAAAACCAAG TCCACCTCCG CCTCCATTAA TAATCTGAAA 1750 

CCGGGAACAG TGTACGTCTT TCAGATCCGG GCGGTCACTG CTGCCGGTTA 1800 

TGGAAACTAC AGCCCTAGGC TTGATGTTGC CACACTTGAG GAAGCTTCAG 1850 

GTAAAATGTT TGAAGCGACA GCAGTCTCCA GTGAACAGAA TCCTGTCATC 1900 

ATAATTGCTG TAGTGGCTGT AGCAGGGACC ATCATCTTGG TGTTCATGGT 1950 

GTTCGGCTTC ATCATTGGAA GAAGGCACTG TGGTTATAGC AAGGCTGACC 2000 
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AAGAAGGGGA TGAAGAACTC TACTTTCATT TTAAATTTCC AGGCACCAAA 2050 

ACCTACATTG ACCCTGAAAC CTATGAGGAC CCAAATAGAG CTGTCCATCA 2100 

ATTCGCCAAG GAGCTAGATG CCTCCTGTAT TAAAATTGAG CGTGTGATTG 2150 

GTGCAGGAGA ATTTGGAGAA GTTTGCAGTG GTCGTTTGAA ACTTCCGGGC 2200 

CAGAGAGATG TTGCAGTGGC CATAAAAACC CTGAAAGTTG GTTACACAGA 2250 

AAAGCAAAGG AGGGACTTTT TATGCGAAGC AAGCATCATG GGGCAATTTG 2300 

ACCACCCAAA TGTCGTCCAT TTGGAAGGGG TTGTTACAAG AGGGAAGCCT 2350 

GTCATGATTG TGATAGAGTT CATGGAGAAT GGAGCCCTGG ATGCATTTCT ' 2400 

CAGGAAACAC GATGGGCAGT TTACAGTCAT TCAGTTGGTA GGAATGTTGA 2450 

GAGGTATTGC CGCTGG"GATG CGATACTTGG CTGATATGGG ATACGTTCAC 2500 

AGGGACCTTG CAGCGCGCAA CATCCTTGTC AACAGCAATC TTGTTTGTAA 2550 

AGTGTCAGAT TTTGGCCTTT CCCGGGTTAT AGAGGATGAT CCCGAAGCTG 2600 

TCTACACCAC GACTGGTGGA AAAATTCCAG TAAGGTGGAC TGCACCGGAA 2650 

GCCATTCAAT ACCGGAAGTT CACCTCAGCC AGCGATGTGT GGAGCTATGG 2700 

GATTGTCATG TGGGAAGTGA TGTCTTATGG AGAAAGACCT TACTGGGACA 2750 

TGTCAAATCA AGATGTCATT AAAGCGATAG AAGAAGGTTA TCGTTTGCCG 2800 

GCGCCCATGG ■ ATTGCCCAGC TGGTCTTCAC "CAGCTAATGC TGGATTGTTG 2850 

GCAGAAAGAT CGGGCGGAAA GGCCAAAGTT TGAGCAGATA GTCGGAATTC 2900 

TAGACAAAAT GATTCGAAAC CCAAGTAGTC TGAAAACACC CCTGGGAACT 2950 

TGTAGTAGAC CCTTAAGCCC TCTTCTGGAC CAGAGCACTC CTGACTTCAC 3000 

TGCCTTCTGT TCAGTTGGAG AATGGTTGCA AGCTATTAAA ATGGAAAGGT 3050 

ATAAGGACAA CTTCACAGCA GCGGGTTACA ACTCACTCGA GTCAGTGGCC 3100 

AGGATGACTA TCGATGATGT GATGAGTTTA GGGATCACAC TGGTTGGCCA 3150 

TCAAAAGAAG . ATCATGAGCA GCATCCAGAC TATGCGGGCA CAAATGTTGC 3200 

ATTTACACGG AACAGGCATC CAAGTGTGAC ACATCGGCCT CCCTCAGATG 3250 

AGGCTTAAGA CTGCAGGAGA ACAGTTCTGG CCTTCAGTAT ACGCATAGAA 3300 

TGCTGCTAGA AGACAGTTGA TATACTGGGT CCTTCCTACA AGAAAGAGAA 3350 

GATTTTAGAA GCACCTCCAG ACTTGAACTC CTAAGTGCCA CCAGAATATA 3400 

SSSD/27372. vOl 






107 



CAAAAAGGGA ATTTAGGATC CACCACTGGT GGCCAGGAAC ACAGCAGAGA 3450 



ACCTCCTTTT TATCTTATAG ACTTTTTAAA AATGTACATA AAGAATTTAA 3550 

GAAAGAATAT ATTTGTCAAA TAAAAATCAT GATCTTATTG TTAAAATCAA 3600 

TGAAATATTT TCCTTAAAAT ATGTGATTTC AGACTATTCT TTTCCAGAAC 3650 

CATCTGTGTT TATTCTGCTT AAGGACTTTG TTTTAGAAAG TTATTTGTAG 3700 

CTTTGGACCT TTTTAGTGTT AAATTTATGA CACGTTACTA CACTGGGAAC 3750 

CTTTGAAGAC TCTCAAACTT AAAGGAAAGC AAAACTACGC ACATAGTCGA 3800 

GGATGGACTT TGTCCTTCAT GGCTTTGGTA TCCTGGCTGT GTCATTTTGT 3850 

TAAACCAGTG ATGTTTTCAT ATTGTTTGCT GATTGGCAGG TAGTTCAAAA 3900 

TTGCAAGTTG CCAAGAGCTC TGATATTTTT TAACAGGATT TTTTTTTCTT 3950 

TGTAAAAATC AGATAACATA CTAACTTTTC AATGAAAAAA AAAAAAAAAG 4000 

AAGCAATAAT GATCCATAAA TACTATAAGG CACTTTTAAC AGATTGTTTA 4050 

TAGAGTGATT TACTAGGCAG AATTTAATAA AAAAAAAAGA GAGATGTCAA .4100 

ATTTTAGGTT TATGTGTATA TGATAAAAGG CTGAGCTTCG TCTGAAGATG 4150 

CTGGTGAAAG CAAGACTGGA AGCGAAGCTC TCCAGCTTTG GCTAACCCAA 4200 

TCCGAGCACA TCAAGAGCTT CAGTCTTGTG ACAGTAAGAA ATTTAGGAAC 4250 

ATAGTTGACC TATATTTTGT ATTCTTTCTT GTTGAATGCA GTCCAAATAC 4300 

AAAA 4304 

(2) INFORMATION FOR SEQUENCE ID NO: 2: 
(i) SEQUENCE CHARACTERISTICS: 



CAATAAACAA 



AGTACTACCT 



GAAAAACATC 



CCAACACCTT GAGCTCTCGA 



3500 



(A) LENGTH: 

(B) TYPE: 

(C) STRANDNESS: 

(D) TOPOLOGY: 



998 amino acids 
amino acid 
single 
linear 



(ii) 



MOLECULE TYPE: 



peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Val Val Gin Thr Arg Phe Pro Ser Trp lie lie Leu Cys Tyr lie 
15 10 15 

Trp Leu Leu Gly Phe Ala His Thr Gly Glu Ala Gin Ala Ala Lys Glu 
20 25 30 

Val Leu Leu Leu Asp Ser Lys Ala Gin Gin Thr Glu Leu Glu Trp lie 
35 40 . 45 

Ser Ser Pro Pro Ser Gly Trp Glu Glu lie Ser Gly Leu Asp Glu Asn 
50 55 60 

Tyr Thr Pro lie Arg Thr Tyr Gin Val Cys Gin Val Met Glu Pro Asn 
65 * 70 75 80 

Gin Asn Asn Trp Leu Arg Thr Asn Trp lie Ser Lys Gly Asn Ala Gin 
85 90 95 

Arg lie Phe Val Glu Leu Lys Phe Thr Leu Arg Asp Cys Asn Ser Leu 
100 105 110 

Pro Gly Val Leu Gly Thr Cys Lys Glu Thr Phe Asn Leu Tyr Tyr Tyr 
115 120 125 

Glu Thr Asp Tyr Asp Thr Gly Arg Asn lie Arg Glu Asn Leu Tyr Val 
130 135 140 

Lys lie Asp Thr lie Ala Ala Asp Glu Ser Phe Thr Gin Gly Asp Leu 
.145 150 155 160 

Gly Glu Arg Lys Met Lys Leu Asn Thr Glu Val Arg Glu lie Gly Pro 
165 170 175 

Leu Ser Lys Lys Gly Phe Tyr Leu Ala Phe Gin Asp Val Gly Ala Cys 
180 185 190 

lie Ala Leu Val Ser Val Lys Val Tyr Tyr Lys Lys Cys Trp Thr lie 
195 200 205 

Val Glu Asn Leu Ala Val Phe Pro Asp Thr Val Thr Gly Ser Glu Phe 
210 215 220 

Ser Ser Leu Val Glu Val Arg Gly Thr Cys Val Ser Ser Ala Glu Glu 
225 230 * 235 240 

Glu Ala Glu Asn Ser Pro Arg Met His Cvs Ser Ala Glu Gly Glu Trp 
245 250 255 

Leu Val Pro lie Gly Lys Cys lie Cys Lys Ala Gly Tyr Gin Gin Lys 
260 265 270 
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Gly Asp Thr Cys Glu Pro Cys Gly Arg Arg Phe Tyr Lys Ser Ser Ser 
275 280 285 

Gin Asp Leu Gin Cys Ser Arg Cys Pro Thr His Ser Phe Ser Asp Arg 
290 295 300 

Glu Gly Ser Ser Arg Cys Glu Cys Glu Asp Gly Tyr Tyr Arg Ala Pro 
305 310 315 320 

Ser Asp Pro Pro Tyr Val Ala Cys Thr Arg Pro Pro Ser Ala Pro Gin 
325 330 335 

Asn Leu lie Phe Asn lie Asn Gin Thr Thr Val Ser Leu Glu Trp Ser 
340 345 350 

Pro Pro Ala Asp Asn Gly Gly Arg Asn Asp Val Thr Tyr Arg lie Leu 
355 360 365 

Cys Lys Arg Cys Ser Trp Glu Gin Gly Glu Cys Val Pro Cys Gly Ser 
370 375 380 

Asn lie Gly Tyr Met Pro Gin Gin Thr Gly Leu Glu Asp Asn Tyr Val 
385 390 395 400 

Thr Val Ntet Asp Leu Leu Ala His Ala Asn Tyr Thr Phe Glu Val Glu 
405 410 415 

Ala Val Asn Gly Val Ser Asp Leu Ser Arg Ser Gin Arg Leu Phe Ala 
420 425 430 

Ala Val Ser He Thr Thr Gly Gin Ala Ala Pro Ser Gin Val Ser Gly 
435 440 445 

Val Met Lys Glu Arg Val Leu Gin Arg Ser Val Gin Leu Ser Trp Gin 
450 455 460 

Glu Pro Glu His Pro Asn Gly Val He Thr Glu Tyr Glu He Lys Tyr 
465 470 475 480 

Tyr Glu Lys Asp Gin Arg Glu Arg Thr Tyr Ser Thr Leu Lys Thr Lys 
485 490 495 

Ser Thr Ser Ala Ser He Asn Asn Leu Lys Pro Gly Thr Val Tyr Val 
500 505 510 

Phe Gin He Arg Ala Val Thr Ala Ala Gly Tyr Gly Asn Tyr Ser Pro 
515 520 525 

Arg Leu Asp Val Ala Thr Leu Glu Glu Ala Ser Gly Lys Met Phe Glu 
530 535 540 

Ala Thr Ala Val Ser Ser Glu Gin Asn Pro Val He He He Ala Val 
545 550 555 560 
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Val Ala Val 



He lie Gly 



Asp Glu Glu 
595 

He Asp Pro 
610 



Ala Gly Thr 
565 

Arg Arg His 
580 

Leu Tyr Phe 
Glu Thr Tyr 



He He Leu Val Phe Met 
570 

Cys Gly Tyr Ser Lys Ala 
585 

His Phe Lys Phe Pro Gly 
600 

Glu Asp Pro Asn Arg Ala 
615 620 



Val Phe Gly Phe 
575 

Asp Gin Glu Gly 
590 

Thr Lys Thr Tyr 
605 

Val His Gin Phe 



Ala Lys Glu 
625 

Ala Gly Glu 



Gin Arg Asp 



Glu Lys Gin 
675 

Phe Asp His 
690 

Lys Pro Val 
705 

Ala Phe Leu 



Gly Met Leu 



Gly Tyr Val 
755 

Asn Leu Val 
770 



Leu Asp Ala 
630 

Phe Gly Glu 
645 

Val Ala Val 
660 

Arg Arg Asp 

Pro Asn Val 

Met He Val 
710 

Arg Lys His 
725 

Arg Gly He 
740 

His Arg Asp 
Cys Lys Val 



Ser Cys He Lys 



Val Cys Ser Gly 
650 

Ala He Lys Thr 
665 

Phe Leu Cys Glu 
680 

Val His Leu Glu 
695 

He Glu Phe Met 



lie. Glu 
635 

Arg Leu 
Leu Lys 
Ala Ser 



Gly Val 
700 

Glu Asn 
715 



Asp Gly Gin Phe Thr Val 
730 

Ala Ala Gly Met Arg Tyr 
745 

Leu Ala Ala Arg Asn He 
760 

Ser Asp Phe Gly Leu Ser 
775 780 



Arg Val lie Gly 
640 

Lys Leu Pro Gly 
655 

Val Gly Tyr Thr 
670 

He Met Gly Gin 
685 

Val Thr Arg Gly 



Gly Ala Leu Asp 
720 

He Gin Leu Val 
735 

Leu Ala Asp Met 
750 

Leu Val Asn Ser 
765 

Arg Val He Glu 



Asp Asp Pro 
785 

Arg Trp Thr 
Ser Asp Val 



Gly Glu Arg 
835 



Glu Ala Val 
790 

Ala Pro Glu 
805 

Trp Ser Tyr 
820 

Pro Tyr Trp 



Tyr Thr Thr Thr 



Ala He Gin Tyr 
810 

Gly He Val Met 
825 

Asp Met Ser Asn 
840 



Gly Gly Lys He Pro Val 
795 800 

Arg Lys Phe Thr Ser Ala 
815 

Trp Glu Val Met Ser Tyr 
830 

Gin Asp Val He Lys Ala 
845 
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He Glu Glu Gly 
850 

Leu His Gin Leu 
865 

Pro Lys Phe Glu 



Pro Ser Ser Leu 
900 

Pro Leu Leu Asp 
915 

Gly Glu Trp Leu 
930 

Thr Ala Ala Gly 
945 

Asp Asp Val Met 



lie Met Ser Ser 
980 

Gly Thr Gly He 
995 



Tyr Arg Leu Pro 
855 

Met Leu Asp Cys 
870 

Gin He Val Gly 
885 

Lys Thr Pro Leu 



Gin Ser Thr Pro 
920 

Gin Ala He Lys 
935 

Tyr Asn Ser Leu 
950 

Ser Leu Gly He 
965 

lie Gin Thr Met 



Gin Val 
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Ala Pro Met Asp 
860 

Trp Gin Lys Asp 
875 

He Leu Asp Lys 
890 

Gly Thr Cys Ser 
905 

Asp Phe Thr Ala 



Met Glu Arg Tyr 
940 

Glu Ser Val Ala 
955 

Thr Leu Val Gly 
970 

Arg Ala Gin Met 

985 



Cys Pro Ala Gly 



Arg Ala Glu Arg 
880 

Met He Arg Asn 
895 

Arg Pro Leu Ser 
910 

Phe Cys Ser Val 
925 

Lys Asp Asn Phe 



Arg Met Thr lie 
960 

His Gin Lys Lys 
975 

Leu His Leu His 
990 



(2) INFORMATION FOR SEQUENCE ID NO: 3: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 610 amino acids 

(B) TYPE: amino acid 

(C) STRANDNESS: single 

(D) - TOPOLOGY : ; linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Val Val Gin Thr Arg Phe Pro Ser Trp He He Leu Cys Tyr He 
15 10 15 

Trp Leu Leu Gly Phe Ala His Thr Gly Glu Ala Gin Ala Ala Lys Glu 
20 25 30 

Val Leu Leu Leu Asp Ser Lys Ala Gin Gin Thr Glu Leu Glu Trp He 
35 40 45 
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Ser Ser Pro Pro Ser Gly Trp Glu Glu lie Ser Gly Leu Asp Glu Asn 
50 55 60 

Tyr Thr Pro He Arg Thr Tyr Gin Val Cys Gin Val Met Glu Pro Asn 
65 70 75 80 

Gin Asn Asn Trp Leu Arg Thr Asn Trp He Ser Lys Gly Asn Ala Gin 
85 90 95 

Arg He Phe Val Glu Leu Lys Phe Thr Leu Arg Asp Cys Asn Ser Leu 
100 105 110 

Pro Gly Val Leu Gly Thr Cys Lys Glu Thr Phe Asn Leu Tyr Tyr Tyr 
115 120 125 

Glu Thr Asp Tyr Asp Thr Gly Arg Asn He Arg Glu Asn Leu Tyr Val 
130 135 140 

Lys He Asp Thr He Ala Ala Asp Glu Ser Phe Thr Gin Gly Asp Leu 
145 150 155 160 

Gly Glu Arg Lys Met Lys Leu Asn Thr Glu Val Arg Glu He Gly Pro 
165 170 175 

• Leu Ser Lys Lys Gly Phe Tyr Leu Ala Phe Gin Asp Val Gly Ala Cys 
180 185 190 

He Ala Leu Val Ser Val Lys Val Tyr Tyr Lys Lys Cys Trp Thr He 
195 200 205 

Val Glu Asn Leu Ala Val Phe Pro Asp Thr Val Thr Gly Ser Glu Phe 
210 215 220 

Ser Ser Leu Val Glu Val Arg Gly Thr Cys Val Ser Ser Ala Glu Glu 
225 230 235 240 

Glu Ala Glu Asn Ser Pro Arg Met His Cys Ser Ala Glu Gly Glu Trp 
245 250 255 

Leu Val Pro He Gly Lys Cys He Cys Lys Ala Gly Tyr Gin . Gin Lys 
260 265 270 

Gly Asp Thr Cys Glu Pro Cys Gly Arg Arg Phe Tyr Lys Ser Ser Ser 
275 280 285 

Gin Asp Leu Gin Cys Ser Arg Cys Pro Thr His Ser Phe Ser Asp Arg 
290 295 300 

Glu Gly Ser Ser Arg Cys Glu Cys Glu Asp Gly Tyr Tyr Arg Ala Pro 
305 310 315 320 

Ser Asp Pro Pro Tyr Val Ala Cys Thr Arg Pro Pro Ser Ala Pro Gin 
325 330 335 
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Asn Leu lie Phe Asn lie Asn Gin Thr Thr Val . Ser Leu Glu Trp Ser 
340 345 350 

Pro Pro Ala Asp Asn Gly Gly Arg Asn Asp Val Thr Tyr Arg lie Leu 
355 360 365 

Cys Lys Arg Cys Ser Trp Glu Gin Gly Glu Cys Val Pro Cys Gly Ser 
370 375 380 

Asn lie Gly Tyr Met Pro Gin Gin Thr Gly Leu Glu Asp Asn Tyr Val 
385 390 395 400 

Thr Val Met Asp Leu Leu Ala His Ala Asn Tyr Thr Phe Glu Val Glu 
405 410 415 

Ala Val Asn Gly Val Ser Asp Leu Ser Arg Ser Gin Arg Leu Phe Ala 
420 425 430 

Ala Val Ser He Thr . Thr Gly Gin Ala Ala Pro Ser Gin Val Ser Gly 
435 440 445 

Val Met Lys Glu Arg Val Leu Gin Arg Ser Val Gin Leu Ser Trp Gin 
450 455 460 

Glu Pro Glu His Pro Asn Gly Val He Thr Glu Tyr Glu He Lys Tyr 
465 470 475 480 

Tyr Glu Lys Asp Gin Arg Glu Arg Thr Tyr Ser Thr Leu Lys Thr Lys 
485 490 495 

Ser Thr Ser Ala Ser He Asn Asn Leu Lys Pro Gly Thr Val Tyr Val 
500 505 510 

Phe Gin He Arg Ala Val Thr Ala Ala Gly Tyr Gly Asn Tyr Ser Pro 
515 520 525 

Arg Leu Asp Val Ala Thr Leu Glu Glu Ala Ser Gly Lys Met Phe Glu 
530 . 535 540 

Ala Thr Ala Val Ser Ser * Glu Gin Asn Pro Val He He He Ala Val 
545 550 555 560 

Val Ala Val Ala Gly Thr He He Leu Val Phe Met Val Phe Gly Phe 

565 570 575 

He He Gly Arg Arg His Cys Gly Tyr Ser Lys Ala Asp Gin Glu Gly 
580 585 590 

Asp Glu Glu Leu Tyr Phe His Ser Leu Val Thr Asn Glu His Leu Ser 
595 600 v 605 

Val Leu 
610 
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(2) INFORMATION FOR SEQUENCE ID NO: 4: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2901 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

AAGCGGCCGG TCTGCAGTCG GAGACTTGCA GGCAGCAAAC ACGGTGCGAA 50 

CGAACCGGAG GGGGGAGAGA GAAATCAAAC AGCTAAGCGT GGAGCAGACG 100 

GCCTGGGACC CAGAAGGGGA TCGATGCGAG GAGCGCAATA ATAACAACAA 150 

TAATAACCCA CTTCGGAGCA AACAGCATCT AAAGAGCTGC GACCCAACTG 200 

CAGCCTAAAA AAATCAAACC TGCTCATGCA CCATGGTTGT TCAAACTCGG 250 

TTCCCTTCGT GGATTATTTT GTGTTACATC TGGCTGCTTG GCTTTGCACA 3 00 

CACGGGGGAG GCGCAGGCTG CGAAGGAAGT ACTATTACTG GACTCGAAAG 350 

CACAACAAAC AGAATTGGAA TGGATTTCCT CTCCACCCAG TGGGTGGGAA 400 

GAAATTAGTG GTTTGGATGA GAACTACACT CCGATAAGAA CATACCAGGT 450 

GTGCCAGGTC ATGGAGCCCA ACCAGAACAA CTGGCTGCGG ACTAACTGGA 500 

TTTCTAAAGG CAACGCACAA AGGATTTTTG TAGAATTGAA ATTCACCTTG 550 

AGGGATTGTA ATAGTCTTCC CGGAGTCCTG GGAACTTGCA AGGAAACGTT 600 

TAATTTGTAC TATTATGAAA CAGACTACGA CACCGGCAGG AATATACGAG 650 

AAAACCTTTA TGTTAAAATA GACACCATTG CTGCAGATGA AAGTTTCACA 700 

CAAGGTGACC TTGGTGAAAG AAAGATGAAG CTGAACACTG AGGTGAGAGA 750 

GATTGGACCT TTGTCCAAAA AGGGATTCTA TCTTGCCTTT CAGGATGTAG 800 

GGGCTTGCAT AGCATTGGTT TCTGTCAAAG TGTACTACAA GAAGTGCTGG 850 

ACCATTGTTG AGAACTTAGC TGTCTTTCCA GATACAGTGA CTGGTTCGGA 900 

ATTTTCCTCC TTAGTCGAGG TCCGTGGGAC ATGTGTCAGC AGTGCCGAGG 950 

AAGAGGCAGA AAATTCCCCC AGAATGCATT GCAGTGCAGA AGGAGAGTGG 1000 

CTAGTACCCA TTGGAAAATG CATCTGCAAA GCAGGCTATC AGCAAAAAGG 1050 
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GGACACTTGC GAACCCTGTG GCCGCAGGTT CTACAAATCT TCCTCTCAGG 1100 

ATCTCCAGTG TTCTCGTTGT CCAACCCACA GCTTCTCTGA CCGAGAAGGA 1150 

TCATCCAGGT GTGAATGTGA AGATGGGTAC TACAGAGCTC CTTCTGATCC 1200 

ACCATACGTT GCATGCACGA GGCCTCCCTC TGCACCACAG AACCTTATTT 1250 

TCAATATCAA TCAAACGACT GTAAGTTTGG AATGGAGTCC TCCGGCTGAC 1300 

AACGGGGGAA GAAACGATGT CACCTACAGA ATACTGTGTA AGCGGTGCAG 1350 

TTGGGAACAG GGAGAATGTG TGCCATGCGG AAGTAACATT GGATACATGC 1400 

CCCAGCAGAC GGGATTAGAG GATAACTATG TCACTGTCAT GGACCTACTT 1450 

GCCCATGCAA ATTACACTTT CGAAGTTGAA GCTGTAAATG GAGTTTCGGA 1500 

CTTAAGCAGA TCCCAGAGGC TCTTC0CTGC TGTTAGCATC ACCACCGGTC 1550 

AAGCAGCTCC CTCGCAAGTG AGTGGAGTCA TGAAGGAGCG AGTACTGCAG 1600 

CGGAGTG.TGC AGCTTTCCTG GCAGGAGCCG GAGCATCCCA ATGGAGTCAT 1650 

CACGGAATAT GAAATCAAGT ATTATGAGAA AGATCAACGG GAAAGGACGT ■ 1700 

ACTCAACACT CAAAACCAAG TCCACCTCCG CCTCCATTAA. TAATCTGAAA .1750 

CCGGGAACAG TGTACGTCTT TCAGATCCGG GCGGTCACTG CTGCCGGTTA 1800 

TGGAAACTAC AGCCCTAGGC TTGATGTTGC CACACTTGAG GAAGCTTCAG 1850 

GTAAAATGTT TGAAGCGACA GCAGTCTCCA GTGAACAGAA TCCTGTCATC 1900 

ATAATTGCTG TAGTGGCTGT AGCAGGGACC ATCATCTTGG TGTTCATGGT 1950 

GTTCGGCTTC ATCATTGGAA GAAGGCACTG TGGTTATAGC AAGGCTGACC 2000 

AAGAAGGGGA TGAAGAACTC TACTTTCATT CTTTAGTAAC AAATGAGCAC 2050 

CTGTCAGTTT TATAAACCGC AACAATAACT GTTTAAGACA ATCAATTTTG 2100 

GATAAACAAT CAACTACAGC AGAATAAATC AAGATTTTTA AGTCCCATTT 2150 

TCCTTTATAC ATTCTGCTTA TTTTGTTGTT ATATGTTTAT TTTTTAAACT 2200 

CTGATCTTGA TTGAATGTGA TACCATAAGC ACAGTTAGGC TGCAGTGTAA 2250 

ATATATAAAG ACATTGTTCT GAGAGCAGTA CGATTTCATG GAAAGATTGT -2300 

TTGGTGGCTT TGTTAAAATT AATAAAGAAT TTTTAAGGAT ATAGTGTAAT 2350 

TTTCTTCATT GCATTAATAT AACCAAATAT GCCTACCTAT CTTTGTCTTG 2400 

AACCAAATGA ATAGATTTGG AATACTTTAT TGTAATTGAA TTTGATATAA 2450 
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AGTTGACTGA 


GCATTTATGT 


GTTACCTGCA 


TGCTTCTGGG 


TGCATTGAAA 


2500 


TATTTTAACT 


TTTAAAATGA 


TACTATGTTG 


TTTCAATTTT 


GACTACCTTT 


2550 


TGTGAGGCAT 


ACTGGCTACC 


TCCTCCTATT 


AGCTAAGATC 


TTCCAAAGCC 


2600 


TTATAATGAA 


AAGTTTATAT 


AAACCATTTC 


TCTTTCAAAT 


CACTGTCATA 


2650 


CTTGGTCACG 


GATCCCAGGA 


ATATTGTAAA 


TTTTCTAATT 


TACTCTGCAC 


2700 


TTTGTATATC 


CAGCCTCTAT 


TACCCTCAAG 


GTGAATATAA 


AACTATGTCT 


2750 


XXX VJxTXTk X r\ X X 


1^<1U1 X X w^l X 


TTTGTGATAG 


CAGTCCCTCA 


X *i X W X X \J X 


O w VJ 


TAATTTTATG 


TATATGTCAA 


CAGTGGTTGG 


TCTTTAAAAA 


TAAATCAAAG 


2850 


AATAAGTAAA 


AAAAAAAAAA 


AAAAAAAAAA 


AAAAATAAAA 


AAAAAAAAAA 


2900 


A 










2901 



(2) INFORMATION FOR SEQUENCE ID NO : 5: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Val Val Gin Thr Arg Phe Pro Ser Trp lie lie Leu Cys Tyr lie 
15 10 15 

Trp Leu Leu Gly Phe Ala His Thr Gly Glu Ala Gin Ala Ala Lys Glu 
20 25 30 

Val Leu Leu Leu Asp Ser Lys Ala Gin Gin Thr Glu Leu Glu Trp lie 
35 40 45 

Ser Ser Pro Pro Ser Gly Trp Glu Glu lie Ser Gly Leu Asp Glu Asn 
50 55 60 

Tyr Thr Pro lie Arg Thr Tyr Gin Val Cys Gin Val Met Glu Pro Asn 
65 70 75 80 

Gin Asn Asn Trp Leu Arg Thr Asn Trp lie Ser Lys Gly Asn Ala Gin 
85 90 95 

Arg lie Phe Val Glu Leu Lys Phe Thr Leu Arg Asp Cys Asn Ser Leu 
100 105 110 
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Pro Gly Val Leu Gly Thr Cys Lys Glu Thr Phe Asn Leu Tyr Tyr Tyr 
115 120 125 

Glu Thr Asp Tyr Asp Thr Gly Arg Asn lie Arg Glu Asn Leu Tyr Val 
130 , 135 140 

Lys lie Asp Thr lie Ala Ala Asp Glu Ser Phe Thr Gin Gly Asp Leu 
145 150 155 160 

Gly Glu Arg Lys Met Lys Leu Asn Thr Glu Val Arg Glu lie Gly Pro 
165 170 175 

Leu Ser Lys Lys Gly Phe Tyr Leu Ala Phe Gin Asp Val Gly Ala Cys 
180 185 190 

lie Ala Leu Val Ser Val Lys Val Tyr Tyr Lys Lys Cys Trp Thr lie 
195 200 205 

Val Glu Asn Leu Ala Val Phe Pro Asp Thr Val Thr Gly Ser Glu Phe 
210 215 220 

Ser Ser Leu Val Glu Val Arg Gly Thr Cys Val Ser Ser Ala Glu Glu 
225 230 235 240 

Glu Ala Glu Asn Ser Pro Arg Met His Cys Ser Ala Glu Gly Glu Trp 
245 250 255 

Leu Val Pro lie Gly Lys Cys lie Cys Lys Ala Gly Tyr Gin Gin Lys 
260 265 270 

Gly Asp Thr Cys Glu Pro Cys Gly Arg Arg Phe Tyr Lys Ser Ser Ser 
275 280 285 

Gin Asp Leu Gin Cys Ser Arg Cys Pro Thr His Ser Phe Ser Asp Arg 
290 295 300 

Glu Gly Ser Ser Arg Cys Glu Cys Glu Asp Gly Tyr Tyr Arg Ala Pro 
305 310 315 320 

Ser Asp Pro Pro Tyr Val Ala Cys Thr Arg Pro Pro Ser Ala Pro Gin 
325 330 335 

Asn Leu lie Phe Asn lie Asn Gin Thr Thr Val Ser Leu Glu Trp Ser 
340 345 350 

Pro Pro Ala Asp Asn Gly Gly Arg Asn Asp Val Thr Tyr Arg lie Leu 
355 360 365 

Cys Lys Arg Cys Ser Trp Glu Gin Gly Glu Cys Val Pro Cys Gly Ser 
370 375 380 

Asn lie Gly Tyr Met Pro Gin Gin Thr Gly Leu Glu Asp Asn Tyr Val 
385 390 395 400 
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Thr Val Met Asp 



Ala Val 

Ala Val 

Val Met 
450 

Glu Pro 
465 

Tyr Glu 

Ser Thr 

Phe Gin 

Arg Leu 
530 

Ala Thr 
545 

Val Ala 
He He 
Asp Glu 



Met Glu 
610 

Arg Leu 
625 



Asn Gly 
420 

Ser He 
435 



Leu Leu 
405 

Val Ser 
Thr Thr 
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Ala His Ala 



Asn Tyr Thr 
410 



Phe Glu Val Glu 
415 



Lys Glu Arg Val 
Glu His 
Lys Asp 



Pro Asn 
470 



Ser Ala 
500 

He Arg 
515 



Gin Arg 
485 

Ser He 
Ala Val 



Asp Val Ala Thr 
Ala Val 
Val Ala 



Ser Ser 
550 



Asp Leu Ser 
425 

Gly Gin Ala 
440 

Leu Gin Arg 
455 

Gly Val He 



Glu Arg Thr 



Asn Asn Leu 
505 

Thr Ala Ala 
520 • 

Leu Glu Glu 
535 

Glu Gin Asn 



Arg Ser Gin 



Ala Pro Ser 



Arg Leu Phe Ala 
430 

Gin Val Ser Gly 
445 



Gly Thr 
565 



Gly Arg 
580 



Arg His 
Tyr Phe 
Lys Thr Gin His 



Glu Leu 
595 



He He Leu 



Cys Gly Tyr 
585 

His Ser Leu 
600 

Asn Lys Lys 
615 



Ser Val Gin 
460 

Thr Glu Tyr 
475 

Tyr Ser Thr 
490 

Lys Pro Gly 

Gly Tyr Gly 

Ala Ser Gly 
540 

Pro Val He 
555 

Val Phe Met 
570 

Ser Lys Ala 
Tyr Arg Glu 



Trp Met He 
620 



Leu Ser Trp Gin 



Glu He Lys Tyr 
480 

Leu Lys Thr Lys 
495 

Thr Val Tyr Val 
510 

Asn Tyr Ser Pro 
525 

Lys Met Phe Glu 



He He Ala Val 
560 

Val Phe Gly Phe 
575 

Asp Gin Glu Gly 
590 

Arg Gly Asp Gly 
605 

Ala Ser Cys Ser 



(2) INFORMATION FOR SEQUENCE ID NO: 6: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 

(B) TYPE: 

(C) STRANDNESS: 

(D) TOPOLOGY: 



2323 base pairs 
nucleic acid 
single 
linear 
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(ii) 



MOLECULE TYPE: 



nucleic 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
AAGCGGCCGG TCTGCAGTCG GAGACTTGCA GGCAGCAAAC ACGGTGCGAA 
CGAACCGGAG GGGGGAGAGA GAAATCAAAC AGCTAAGCGT GGAGCAGACG 
GCCTGGGACC CAGAAGGGGA TCGATGCGAG GAGCGCAATA ATAACAACAA 
TAATAACCCA CTTCGGAGCA AACAGCATCT AAAGAGCTGC GACCCAACTG 
CAGCCTAAAA AAATCAAACC TGCTCATGCA CCATGGTTGT TCAAACTCGG 
TTCCCTTCGT GGATTATTTT GTGTTACATC TGGCTGCTTG GCTTTGCACA 
CACGGGGGAG GCGCAGGCTG CGAAGGAAGT ACTATTACTG GACTCGAAAG 
CACAACAAAC AGAATTGGAA TGGATTTCCT CTCCACCCAG TGGGTGGGAA 
GAAATTAGTG GTTTGGATGA GAACTACACT CCGATAAGAA CATACCAGGT 
GTGCCAGGTC ATGGAGCCCA ACCAGAACAA CTGGCTGCGG ACTAACTGGA 
TTTCTAAAGG CAACGCACAA AGGATTTTTG TAGAATTGAA ATTCACCTTG 
AGGGATTGTA ATAGTCTTCC CGGAGTCCTG GGAACTTGCA AGGAAACGTT 
TAATTTGTAC TATTATGAAA CAGACTACGA CACCGGCAGG AATATACGAG 
AAAACCTTTA TGTTAAAATA GACACCATTG CTGCAGATGA AAGTTTCACA 
CAAGGTGACC TTGGTGAAAG AAAGATGAAG CTGAACACTG AGGTGAGAGA 
GATTGGACCT TTGTCCAAAA AGGGATTCTA TCTTGCCTTT CAGGATGTAG 
GGGCTTGCAT AGCATTGGTT TCTGTCAAAG TGTACTACAA GAAGTGCTGG 
ACCATTGTTG AGAACTTAGC TGTCTTTCCA GATACAGTGA CTGGTTCGGA 
ATTTTCCTCC TTAGTCGAGG TCCGTGGGAC ATGTGTCAGC AGTGCCGAGG 
AAGAGGCAGA AAATTCCCCC AGAATGCATT GCAGTGCAGA AGG AGAGTGG . 
CTAGTACCCA TTGGAAAATG CATCTGCAAA GCAGGCTATC AGCAAAAAGG 
GGACACTTGC GAACCCTGTG - GCCGCAGGTT CTACAAATCT TCCTCTCAGG 
ATCTCCAGTG TTCTCGTTGT CCAACCCACA GCTTCTCTGA CCGAGAAGGA 
TCATCCAGGT GTGAATGTGA AGATGGGTAC TACAGAGCTC CTTCTGATCC 
ACCATACGTT GCATGCACGA GGCCTCCCTC TGCACCACAG AACCTTATTT 
TCAATATCAA TCAAACGACT GTAAGTTTGG AATGGAGTCC TCCGGCTGAC 
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50 
100 
150 
200 
250 
300. 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
950 
1000 
1050 
1100 
1150 
1200 
1250 
1300 
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AACGGGGGAA GAAACGATGT CACCTACAGA ATACTGTGTA AGCGGTGCAG 13 50 

TTGGGAACAG GGAGAATGTG TGCCATGCGG AAGTAACATT GGATACATGC 1400 

CCCAGCAGAC GGGATTAGAG GATAACTATG TCACTGTCAT GGACCTACTT 14 50 

GCCCATGCAA ATTACACTTT CGAAGTTGAA GCTGTAAATG GAGTTTCGGA 1500 

CTTAAGCAGA TCCCAGAGGC TCTTCGCTGC TGTTAGCATC ACCACCGGTC 1550 

AAGCAGCTCC CTCGCAAGTG AGTGGAGTCA TGAAGGAGCG AGTACTGCAG 1600 

CGGAGTGTGC AGCTTTCCTG GCAGGAGCCG GAGCATCCCA ATGGAGTCAT 1650 

CACGGAATAT GAAATCAAGT ATTATGAGAA AGATCAACGG GAAAGGACGT 1700 

ACTCAACACT CAAAACCAAG TCCACCTCCG CCTCCATTAA TAATCTGAAA 1750 

CCGGGAACAG TGTACGTCTT TCAGATCCGG GCGGTCACTG CTGCCGGTTA 1800 

TGGAAACTAC AGCCCTAGGC TTGATGTTGC CACACTTGAG GAAGCTTCAG 1850 

GTAAAATGTT TGAAGCGACA GCAGTCTCCA GTGAACAGAA TCCTGTCATC 1900 

ATAATTGCTG TAGTGGCTGT AGCAGGGACC ATCATCTTGG TGTTCATGGT 1950 

GTTCGGCTTC ATCATTGGAA GAAGGCACTG TGGTTATAGC AAGGCTGACC 2000 

AAGAAGGGGA TGAAGAACTC TACTTTCATT CTCTTTACAG GGAAAGGGGA 2050 

GACGGGATGG AAAAGACACA GCACAATAAG AAGTGGATGA TTGCATCGTG 2100 

CTCTCGTTTG TAGGTCTCTT TTCCTAATCA ACACTATGAT TTTGAAGTAC 2150 

GCGTACACGA -AGCAAACGGG AAGAGATAAG GAATTAGCAT TGTGAACCTG 2200 

ACTGTAATCC TCTCTTCCGG AAAGAGATGA GATGCTATTG CGATGAGAAT 2250 

GTACAACTTG CACCTTGAAA TCTTTTTTGA TAATTAGTGC TCAGGGGAGG 2300 

GGGGGGGAAG TAGAGAAAGC AAA 2323 

(2) INFORMATION FOR SEQ ID NO: 7: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 

(B) TYPE: 

(C) STRANDEDNESS 

(D) TOPOLOGY: 



6 amino acids 
amino acid 
single 
linear 



(ii) 



MOLECULE TYPE: 



peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 

Ala Ala Thr Ala Ala Ala 

5 



(2) INFORMATION FOR SEQ ID NO: 8: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 

(B) TYPE: 

(C) STRANDEDNESS : 

(D) TOPOLOGY: 

(ii) MOLECULE TYPE: 

(xi) SEQUENCE DESCRIPTION: 

Ala Ala Thr Ala Ala Ala 

5 



6 amino acids 
amino acid 
single 
linear 

peptide 

SEQ ID NO: 8: 



(2) INFORMATION FOR SEQ ID NO : 9: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 

(B) TYPE: 

(C) STRANDEDNESS: 

(D) TOPOLOGY: 



6 amino acids 
amino acid 
single 
linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 

His Arg Asp Leu Ala Ala 
5 



(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 

(B) TYPE: 

(C) STRANDEDNESS: 

(D) TOPOLOGY: 

(ii) MOLECULE TYPE: 



6 amino acids 
amino acid 
single 
linear 

peptide 
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(ix) FEATURE: 

(D) OTHER INFORMATION: 



Xaa in position 2 is valine 
or methionine; Xaa in 
position 5 is phenylalanine 
or tyrosine. 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Asp Xaa Trp Ser Xaa Gly 
5 



(2) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 993 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Met Val Val Gin Thr Arg Phe Pro Ser Trp lie lie Leu Cys Tyr lie 
15 10 15 

Trp Leu Leu Gly Phe Ala His Thr Gly Glu Ala Gin Ala Ala Lys Glu 
20 25 30 

Val Leu Leu Leu Asp Ser Lys Ala Gin Gin Thr Glu Leu Glu Trp lie 
35 40 45 

Ser Ser Pro Pro Ser Gly Trp Glu Glu He Ser Gly Leu Asp Glu Asn 
50 55 60 

Tyr Thr Pro He Arg Thr Tyr Gin Val Cys Gin Val Met Glu Pro Asn 
65 70 75 80 

Gin Asn Asn Trp Leu Arg Thr Asn Trp He Ser Lys Gly Asn Ala Gin 
85 90 95 

Arg He Phe Val Glu Leu Lys Phe Thr Leu Arg Asp Cys Asn Ser Leu 
100 105 110 

Pro Gly Val Leu Gly Thr Cys Lys Glu Thr Phe Asn Leu Tyr Tyr Tyr 
115 120 125 

Glu Thr Asp Tyr Asp Thr Gly Arg Asn He Arg Glu Asn Leu Tyr Val 
130 135 140 
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Lys lie Asp Thr lie Ala Ala Asp Glu Ser Phe Thr Gin Gly Asp Leu 
145 150 155 160 

Gly Glu Arg Lys Met Lys Leu Asn Thr Glu Val Arg Glu lie Gly Pro 
165 170 175 

Leu Ser Lys Lys Gly Phe Tyr Leu Ala Phe Gin Asp Val Gly Ala Cys 
180 185 190 

lie Ala Leu Val Ser Val Lys Val Tyr Tyr Lys Lys Cys Trp Thr lie 
195 200 205 

Val Glu Asn Leu Ala Val Phe Pro Asp Thr Val Thr Gly Ser Glu Phe 
210 215 220 

Ser Ser Leu Val Glu Val Arg Gly Thr Cys Val Ser Ser Ala Glu Glu 
225 230 235 240 

Glu Ala Glu Asn Ser Pro Arg Met His Cys Ser Ala Glu Gly Glu Trp 
245 250 255 

Leu Val Pro lie Gly Lys Cys lie Cys Lys Ala Gly Tyr Gin Gin Lys 
260 265 270 

Gly Asp Thr Cys Glu Pro Cys Gly Arg Arg Phe Tyr Lys Ser Ser Ser 
275 280 285 

Gin Asp Leu Gin Cys Ser Arg Cys Pro Thr His Ser Phe Ser Asp Arg 
290 295 . . 300 

Glu Gly Ser Ser Arg Cys Glu Cys Glu Asp Gly Tyr Tyr Arg Ala Pro 
305 310 315 320 

Ser Asp Pro Pro Tyr Val Ala Cys Thr Arg Pro Pro Ser Ala Pro Gin 
325 330 335 

Asn Leu lie Phe Asn lie Asn Gin Thr Thr Val Ser Leu Glu Trp Ser 
340 345 350 

Pro Pro Ala Asp Asn Gly Gly Arg Asn Asp Val Thr Tyr Arg lie Leu 
355 360 365 

Cys Lys Arg Cys Ser Trp Glu Gin Gly Glu Cys Val Pro Cys Gly Ser 
370 .375 380 

Asn lie Gly Tyr Met Pro Gin Gin Thr Gly Leu Glu Asp Asn Tyr Val 
385 390 395 400 

Thr Val Met Asp Leu Leu Ala His Ala Asn Tyr Thr Phe Glu Val Glu 
405 410 415 

Ala Val Asn Gly Val Ser Asp Leu Ser Arg Ser Gin Arg Leu Phe Ala 
420 425 430 
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Ala Val Ser He Thr Thr Gly Gin Ala Ala Pro Ser Gin Val Ser Gly 
435 440 445 

Val Met Lys Glu Arg Val Leu Gin Arg Ser Val Gin Leu Ser Trp Gin 
450 455 460 

Glu Pro Glu His Pro Asn Gly Val He Thr Glu Tyr Glu He Lys Tyr 
465 470 475 480 

Tyr Glu Lys Asp Gin Arg Glu Arg Thr Tyr Ser Thr Leu Lys Thr Lys 
485 490 495 

Ser Thr Ser Ala Ser He Asn Asn Leu Lys Pro Gly Thr Val Tyr Val 
500 505 510 

Phe Gin He Arg Ala Val Thr Ala Ala Gly Tyr Gly Asn Tyr Ser' Pro 
515 520 525 

Arg Leu Asp Val Ala Thr Leu Glu Glu Ala Ser Ala Thr Ala Val Ser 
530 535 540 

Ser Glu Gin Asn Pro Val He He He Ala Val Val Ala Val Ala Gly 
545 550 555 560 

Thr He lie Leu Val Phe Met Val Phe Gly Phe lie lie Gly Arg Arg 
565 570 575 

His Cys Gly Tyr Ser Lys Ala Asp Gin Glu Gly Asp Glu Glu Leu Tyr 
580 585 590 

Phe His Phe Lys Phe Pro Gly Thr Lys Thr Tyr lie Asp Pro Glu Thr 
595 600 605 

Tyr Glu Asp Pro Asn Arg Ala Val His Gin Phe Ala Lys Glu Leu Asp 
610 615 620 

Ala Ser Cys lie Lys He Glu Arg Val He Gly Ala Gly Glu Phe Gly 
625 630 635 6'40 

Glu Val Cys Ser Gly Arg Leu Lys Leu Pro Gly Gin Arg Asp Val Ala 
645 650 655 

Val Ala lie Lys Thr Leu Lys Val Gly Tyr Thr Glu Lys Gin Arg Arg 
660 665 670 

Asp Phe Leu Cys Glu Ala Ser lie Met Gly Gin Phe Asp His Pro Asn 
675 680 685 

Val Val His Leu Glu Gly Val Val Thr Arg Gly Lys Pro Val Met He 
690 695 700 

Val He Glu Phe Met Glu Asn Gly Ala ILeu Asp Ala Phe Leu Arg Lys 
705 710 715 720 



SSSD/27372 



vOl 






125 



His 



Asp Gly Gin Phe Thr Val He Gin 
725 



Leu 
730 



Val 



Gly Met 



Leu 



Arg Gly 
735 



He Ala Ala Gly Met Arg Tyr Leu Ala Asp Met Gly Tyr Val His Arg 
740 745 750 

Asp Leu Ala Ala Arg Asn He Leu Val Asn Ser Asn Leu Val Cys Lys 

755 * 760 765 

Val Ser Asp' Phe Gly Leu Ser Arg Val He Glu Asp Asp Pro Glu Ala 
770 775 780 

Val Tyr Thr Thr Thr Gly Gly Lys He Pro Val Arg Trp Thr Ala Pro 
785 790 795 800 

Glu Ala He Gin Tyr Arg Lys Phe Thr Ser Ala Ser Asp Val Trp Ser 
805 810 815 

Tyr Gly He Val Met Trp Glu Val Met Ser Tyr Gly Glu Arg Pro Tyr 
820 825 830 

Trp Asp Met Ser Asn Gin Asp Val He Lys Ala He Glu Glu Gly Tyr 
835 840 845 

Arg Leu Pro Ala Pro Met Asp Cys Pro Ala Gly Leu His Gin Leu Met 
850 855 860 

Leu Asp Cys Trp Gin Lys Asp Arg Ala Glu Arg Pro Lys Phe Glu Gin 
865 870 875 880 

He Val Gly He Leu Asp Lys Met He Arg Asn Pro Ser Ser Leu Lys 
885 890 895 

Thr Pro Leu Gly Thr Cys Ser Arg Pro Leu Ser Pro Leu Leu Asp Gin 
900 905 910 

Ser Thr Pro Asp Phe Thr Ala Phe Cys Ser Val Gly Glu Trp Leu Gin 
915 920 925 

Ala He Lys Met Glu Arg Tyr Lys Asp Asn Phe Thr Ala Ala Gly Tyr 
930 935 940 

Asn Ser Leu Glu Ser Val Ala Arg Met Thr He Asp Asp Val Met Ser 
945 950 955 960 

Leu Gly He Thr Leu Val Gly His Gin Lys Lys He Met Ser Ser He 
965 970 975 

Gin Thr Met Arg Ala Gin Met Leu His Leu His Gly Thr Gly He Gin 
980 985 990 

Val 
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(2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 994 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 



Met Val Val Gin Thr Arg Phe Pro Ser Trp lie lie Leu Cys Tyr lie 
15 10 15 

Trp Leu Leu Gly Phe Ala His Thr Gly Glu Ala Gin Ala Ala Lys Glu 
20 25 30 

Val Leu Leu Leu Asp Ser Lys Ala Gin Gin Thr Glu Leu Glu Trp lie 
35 40 45 

Ser Ser Pro Pro Ser Gly Trp Glu Glu lie Ser Gly Leu Asp Glu Asn 
50 55 60 

Tyr Thr Pro He Arg Thr Tyr Gin Val Cys Gin Val Met Glu Pro Asn 
65 70 75 80 

Gin Asn Asn Trp Leu Arg Thr Asn Trp He Ser Lys Gly Asn Ala Gin 
85 90 95 

Arg He Phe Val Glu Leu Lys Phe Thr Leu Arg Asp Cys Asn Ser Leu 
100 105 110 

Pro Gly Val Leu Gly Thr Cys Lys Glu Thr Phe Asn Leu Tyr Tyr Tyr 
115 120 125 

Glu Thr Asp Tyr Asp Thr Gly Arg Asn He Arg Glu Asn Leu Tyr Val 
130 135 140 

Lys He Asp Thr He Ala Ala Asp Glu Ser Phe Thr Gin Gly Asp Leu 
145 150 155 160 

Gly Glu Arg Lys Met Lys Leu Asn Thr Glu Val Arg Glu lie Gly Pro 
165 170 175 

Leu Ser Lys Lys Gly Phe Tyr Leu Ala Phe Gin Asp Val Gly Ala Cys 
180 185 190 

He Ala Leu Val Ser Val Lys Val Tyr Tyr Lys Lys Cys Trp Thr He 
195 200 205 
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Val Glu Asn Leu Ala Val Phe Pro Asp Thr Val Thr Gly Ser Glu Phe 
210 215 220 

Ser Ser Leu Val Glu Val Arg Gly Thr Cys Val Ser Ser Ala Glu Glu 
225 230 235 240 

Glu Ala Glu Asn Ser Pro Arg Met His Cys Ser Ala Glu Gly Glu Trp 
245 250 255 

Leu Val Pro He Gly Lys Cys He Cys Lys Ala Gly Tyr Gin Gin Lys 
260 265 270 

Gly Asp Thr Cys Glu Pro Cys Gly Arg Arg Phe Tyr Lys Ser Ser Ser 
275 280 285 

Gin Asp Leu Gin Cys Ser Arg Cys Pro Thr His Ser Phe Ser Asp. Arg 
290 295 300 

Glu Gly Ser Ser Arg Cys Glu Cys Glu Asp Gly Tyr Tyr Arg Ala Pro 
305 310 315 320 

Ser Asp Pro Pro Tyr Val Ala Cys Thr Arg Pro Pro Ser Ala Pro Gin 



Asn Leu He Phe Asn He Asn Gin Thr Thr Val Ser Leu Glu Trp Ser 
340 345 350 

Pro Pro Ala Asp Asn Gly Gly Arg Asn Asp Val Thr Tyr Arg He Leu 
355 360 365 

Cys Lys Arg Cys Ser Trp Glu Gin Gly Glu Cys Val Pro Cys Gly Ser 
370 375 380 

Asn He Gly Tyr Met Pro Gin Gin Thr Gly Leu Glu Asp Asn Tyr Val 
385 390 395 400 

Thr Val Met Asp Leu Leu Ala His Ala Asn Tyr Thr Phe Glu Val Glu 
405 410 415 

Ala Val Asn Gly Val Ser Asp Leu Ser Arg Ser Gin Arg Leu Phe Ala 
420 425 430 

Ala Val Ser He Thr Thr Gly Gin Ala Ala Pro Ser Gin Val Ser. Gly 
435 440 445 

Val Met Lys Glu Arg Val Leu Gin Arg Ser Val Gin Leu Ser Trp Gin 
450 455 460 

Glu Pro Glu His Pro Asn Gly Val He Thr Glu Tyr Glu He Lys Tyr 
465 470 475 480 

Tyr Glu Lys Asp Gin Arg Glu Arg Thr Tyr Ser Thr Leu Lys Thr Lys 



325 



330 



335 



485 



490 



495 
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Ser Thr Ser Ala Ser lie Asn Asn Leu Lys Pro Gly Thr Val Tyr Val 
500 505 510 

Phe Gin lie Arg Ala Val Thr Ala Ala Gly Tyr Gly Asn Tyr Ser Pro 
515 520 525 

Arg Leu Asp Val Ala Thr Leu Glu Glu Ala Ser Gly Lys Met Phe Glu 
530 535 540 

Ala Thr Ala Val Ser Ser Glu Gin Asn Pro Val He He He Ala Val 
545 550 555 560 

Val Ala Val Ala Gly Thr He He Leu Val Phe Met Val Phe Gly Phe 

565 570 575 

He lie Gly Arg Arg His Cys Gly Tyr Ser Lys Ala Asp Gin Glu Gly 
580 585 590 

Asp Glu Glu Leu Tyr Phe His Cys Thr Lys Thr Tyr He Asp Pro Glu 
595 600 605 

Thr Tyr Glu Asp Pro Asn Arg Ala Val His Gin Phe Ala Lys Glu Leu 
610 615 620 

Asp Ala Ser Cys He Lys He Glu Arg Val lie Gly Ala Gly Glu E>he 
625 630 635 ' 640 

Gly Glu Val Cys Ser Gly Arg Leu Lys Leu Pro Gly Gin Arg Asp Val 

645 650 655 

Ala Val Ala He Lys Thr Leu Lys Val Gly Tyr Thr Glu Lys Gin Arg 
660 665 670 

Arg Asp Phe Leu Cys Glu Ala Ser lie Met Gly Gin Phe Asp His Pro 
675 680 685 

Asn Val Val His Leu Glu Gly Val Val Thr Arg Gly Lys Pro Val Met 
690 695 700 

lie Val He Glu Phe Met Glu Asn Gly Ala Leu Asp Ala Phe Leu Arg 
705 710 715 720 

Lys His Asp Gly Gin Phe Thr Val He Gin Leu Val Gly Met Leu Arg 

725 730 735 

Gly lie Ala Ala Gly Met Arg Tyr Leu Ala Asp Met Gly Tyr Val His 
740 745 750 

Arg Asp Leu Ala Ala Arg Asn lie Leu Val Asn Ser Asn Leu Val Cys 
755 '760 765 

Lys Val Ser Asp Phe Gly Leu Ser Arg Val He Glu Asp Asp Pro Glu 
770 775 780 
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Ala Val Tyr 
785 



Thr Thr Thr Gly Gly Lys 
790 



Pro Glu Ala lie Gin Tyr Arg Lys Phe 
805 

Ser Tyr Gly He Val Met Trp Glu Val 
820 825 



Tyr Trp Asp 
835 

Tyr Arg Leu 
850 

Met Leu Asp 
865 



Met Ser Asn Gin Asp Val 
840 

Pro Ala Pro Met Asp Cys 
855 

Cys Trp Gin Lys Asp Arg 
870 



Gin He Val Gly He Leu Asp Lys' Met 
885 

Lys Thr Pro Leu Gly Thr Cys Ser Arg 

900 905 



Gin Ser Thr 
915 

Gin Ala He 
930 

Tyr Asn Ser 
945 

Ser Leu Gly 
He Gin Thr 
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